Incorporating Lr Parsing into Sphinx
نویسنده
چکیده
This paper describes the integration of an LR natural language parser with the SPHINX speech recognition system. SPHINX is one of the most successful speech recognition systems in use today. Although it attains high word accuracy, SPHINX often outputs ungrammatical recognition results because the baseline SPHINX system uses very simple word-pair or bigram language models. For applications of spoken-language processing, sentence accuracy is also important. We have incorporated LR parsing into SPHINX to provide more global grammatical constraints. However, when using a tight grammar in a left-to-right continuous speech recognizer, once things begin to go wrong, they tend to go completely wrong. We also propose a solution to this problem by combining a context-free grammar with a bigram grammar.
منابع مشابه
A Method of Incorporating Bigram Constraints into an LR Table and Its E ectiveness in Natural Language Processing
In this paper, we propose a method for constructing bigram LR tables by way of incorporating bigram constraints into an LR table. Using a bigram LR table, it is possible for a GLR parser to make use of both bigram and CFG constraints in natural language processing. Applying bigram LR tables to our GLR method has the following advantages: (1) Language models utilizing bigram LR tables have lower...
متن کاملA Method of Incorporating Bigram Constraints into an LR Table and Its Effectiveness in Natural Language Processing
In this paper, we propose a method for constructing bigram LR tables by way of incorporating bigram constraints into an LR table. Using a bigram LR table, it is possible for a GLR parser to make use of both bigram and CFG constraints in natural language processing. Applying bigram LR tables to our GLR method has the following advantages: (1) Language models utilizing bigram LR tables have lower...
متن کاملInteractive Speech Understanding
This paper introduces at robust interactive method for speech understatnding. The generatlized LR patrsing is enhanced ill this approach. Patrsing proceeds fl'om left to right correcting minor errors. When at very noisy portion is detected, the patrser skips that portion using a .fake nonterminal symbol. The unidentified portion is resolved by re-utterance of thatt portion which is parsed very ...
متن کاملMinimum Distance Error Correction
A method is presented for incorporating error correction using a minimum distance measure into LR parsers. The method is suitable for use by an automatic parser-generator. State information in the LR parser stack at the point of detection of error is used to generate a set of strings which are potential repairs to the input. A string with least minimum distance from the actual input is chosen a...
متن کاملUnifying LL and LR parsing
In parsing theory, LL parsing and LR parsing are regarded to be two distinct methods. In this paper the relation between these methods is cla-riied. As shown in literature on parsing theory, for every context-free grammar, a so-called non-deterministic LR(0) automaton can be constructed. Here, we show, that traversing this automaton in a special way is equivalent to LL(1) parsing. This automato...
متن کامل